Confidence Scoring of Time Difference of Arrival Estimation for Speaker Localization with Microphone Arrays
نویسندگان
چکیده
Microphone arrays can be employed for passive acoustic source localization using time difference of arrival (TDOA) estimates in microphone pairs. The most common method for this TDOA estimation is the generalized crosscorrelation (GCC) method which is also used in this work. In an office room environmental noise influences and reverberation effects complicate the TDOA estimation and aggravate a robust sound source localization. This publication presents for a stationary single source scenario with real data a method which increases the correct TDOA estimate percentage to 70.30% compared with 61.80% for the standard method of estimating the TDOAs. Furthermore, we show that two properties of the GCC function can be used as confidence criteria of the TDOA estimates: the value of the maximum peak as well as the ratio between the and the peak in the GCC function are appropriate indicators for the reliability of a TDOA estimate. Separating these confidence criteria into intervals with increasing values shows for the interval with highest values a very reliable correct TDOA estimate percentage of about 95% for both criteria.
منابع مشابه
Robust Speaker Localization through Ad (AWEPAT) Estim
Time delay of arrival (TDOA) estimation between signals input to two or more microphones plays an important role in speaker localization. Most methods employ a linear array of two or more microphones and use the generalized cross correlation method or eigenspace analysis (AEDA) methods. TDOA estimation with linear arrays, however, is highly sensitive to estimation errors when the signals arrive...
متن کاملReliability Measurement of Time Difference of Arrival Estimations for Multiple Sound Source Localization
Time Difference Of Arrival (TDOA) estimates are used for passive acoustic single sound source localization with microphone arrays. The technique of choice in most systems for TDOA estimation is the Generalized Cross-Correlation (GCC) method [1]. For a multi-sound source scenario, cross-correlation terms of the active sound source signals as well as noise and reverberation effects complicate the...
متن کاملApproaches for Time Difference of Arrival Estimation in a Noisy and Reverberant Environment
Determining the spatial position of a speaker finds a growing interest in video conference scenario where automated camera steering and tracking are required. As a preliminary step for the localization, microphone array can be used to extract the time difference of arrival (TDOA) of the speech signal. The direction of arrival of the speech signal is then determined by the relative time delay be...
متن کاملCMSC 660 Project Solutions Optimization methods for Sound Source Localization using Microphone arrays
Microphone arrays are widely employed for applications like teleconferencing, high quality sound capture, speaker recognition/identification, acoustic surveillance, head aid devices, speech acquisition in automobile environments etc. For all these applications the benefits that a microphone array provides over a single microphone are two fold. First using a microphone array we can localize a so...
متن کاملSpeaker Localization and Tracking in Mobile Robot Environment Using a Microphone Array?
In this paper a method for speaker localization and tracking is proposed based on Time Difference of Arrival estimation enhanced with so called tuned phase transform. The localization method is based on Pseudo-linear estimator, and Y-shaped array for spatial sampling is proposed and compared to square array. The tracking is realized with Recursive Least-Squares algorithm. At the end, results re...
متن کامل